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DNA POLYMERASE HAVING ABILITY TO REDUCE INNATE 
SELECTIVE DISCRIMINATION AGAINST 
FLUORESCENT DYE-LABELED D I DEOXYNUCLEOT I DES 

BACKGROUND OF THE INVENTION 
lis application is a continuation-in-part 
application of Serial No. 08/544,643 (now U.S. 
Patent 5,7^,298), filed October 18, 1995, and 
08/642,684, fUed May 3, 1996, and the entire 
contents of bo\ appxi-cations are incorporated 
herein by reference. 

The genetic material of all known living 
organisms is deoxyribonucleic acid (DNA) , except 
in certain viruses whose genetic material may be 
ribonucleic acid (RNA) . DNA consists of a chain of 
individual deoxynucleotides chemically linked in 
specific sequences. Each deoxynucleotide contains 
one of the four nitrogenous bases which may be 
adenine (A) , cytosine (C) , guanine (G) or thymine 
(T), and a deoxyribose, which is a pentose, with a 
hydroxyl group attached to its 3' position and a 
phosphate group attached to its 5' position. The 
contiguous deoxynucleotides that form the DNA 
chain are connected to each other by a 
phosphodiester bond linking the 5' position of one 
pentose ring to the 3' position of the next 
pentose ring in such a manner that the beginning 
of the DNA molecule always has a phosphate group 
attached to the 5' carbon of a deoxyribose. The 
end of the DNA molecule always has an OH 
(hydroxyl) group on the 3' carbon of a 
deoxyribose . 

DNA usually exists as a double -stranded 
molecule in which two antiparallel DNA strands are 
held together by hydrogen bonds between the bases 
of the individual nucleotides of the two DNA 
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strands ilR strictly matched -A-T " and »C-G» 
pairing manner. It is the order or sequence of the 
bases in a strand of DNA that determines a gene 
which in turn determines the type of protein to be 
synthesized. Therefore, the accurate determination 
of the sequence of the bases in a DNA strand which 
also constitutes the genetic code for a protein is 
of fundamental importance in understanding the 
characteristics of the protein concerned. 

The process used to determine the 
sequence of the bases in a DNA molecule is 
referred to as DNA sequencing. Among the 
techniques of DNA sequencing, the enzymatic method 
developed by Sanger et al. (1) is most popular. It 
is based on the ability of a DNA polymerase to 
extend a primer annealed to the DNA template to be 
sequenced in the presence of four normal 
deoxynucleotide triphosphates (dNTPs) , namely, 
dATP, dCTP, dGTP and dTTP, and on the ability of 
the nucleotide analogs, the dideoxynucleotide 
triphosphates (ddNTPs) , namely, ddATP, ddCTP, 
ddGTP and ddTTP, to terminate the extension of the 
elongating deoxynucleotide polymers at various 
lengths . 

In the classic one -step Sanger method, 
the sequence determination is carried out in a set 
of four separate tubes, each containing all four 
normal dNTPs, one of which is labeled with a 
radioactive isotope, 32 P or 35 S, for 
autoradiographic localization, a limiting amount 
of one of the four ddNTPs, a DNA polymerase, a 
primer, and the DNA template to be sequenced. As a 
result of the DNA polymerase activity, individual 
nucleotides or nucleotide analogs are added to the 
new DNA chains, all starting from the 3' end of 
the primer in a 5' -3' direction, and each linked 
to adjacent ones with a phosphodiester bond in a 
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base sequCTice complementary to the DNA sequence of 
the template. Inasmuch as there is a nucleotide 
analog in the reaction mixture, each tube 
eventually contains numerous newly formed DNA 
5 strands of various lengths, all ending in a 

particular ddNTP, referred to as A, C, G or T 
terminator. 

After resolving the four sets of 
reaction products by high-resolution 

10 polyacrylamide/urea gel electrophoresis, the 

populations of the newly formed DNA strands are 
separated and grouped according to their molecular 
weight. An autoradiographic image of the gel will 
show the relative positions of these DNA strands 

15 as bands which differ from one another in distance 

measured by one nucleotide in length, all sharing 
an identical primer and terminating with a 
particular ddNTP (A, C ,G or T) . By reading the 
relative positions of these bands in the "ladder" 

2 0 of the autoradiograph, the DNA sequence of the 

template can be deduced. 

The DNA polymerase used in the reaction 
mixture plays a pivotal role in DNA sequencing 
analysis. To be useful for DNA sequencing, a DNA 
25 polymerase must possess certain essential 

properties. For example, it must have its natural 
5' -3' exonuclease activity removed by mutagenesis 
or by posttranslational modification, such as° 
enzymatic digestion, and must be able to 

3 0 incorporate dNTPs and ddNTPs, without undue 

discrimination against ddNTP and with a 
sufficiently high processivity which refers to the 
ability of the enzyme to polymerize nucleotides 
onto a DNA chain continuously without being 
35 dislodged from the chain, and a sufficiently high 

elongation rate. A 5' -3' exonuclease activity 
associated with a DNA polymerase will remove 
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nucleotft from the primer, thus cause a 
heterogeneous 5' end for the newly formed DNA 
strands, resulting in a false reading of the 
strand lengths on the sequencing gel. A DNA 
polymerase with a low processivity and a low 
elongation rate will cause many undesirable noise 
background bands of radioactivity due to the 
presence of DNA strands which are formed with 
improper lengths and improper terminations. Among 
the more commonly used DNA polymerases, Sequenase" 
has a higher processivity and a higher elongation 
rate than others, such as the Klenow fragment, 
Taq, and Vent polymerases (2), and is therefore 
one of the most popular DNA polymerase selected 
for DNA sequencing to-date. 

However, even when a DNA polymerase has 
been endowed with all the essential properties 
listed above, it may still generate erroneous or 
misleading band patterns of radioactivity in the 
20 sequencing gel. These artif actual patterns do not 

faithfully reflect the true nucleotide sequence in 
the template being sequenced. They may be caused 
by premature termination of the elongating strands 
due to the presence of secondary structures formed 
along the template, such as hairpins » in the 
regions that contain palindromic sequences or that 
are rich in G and C bases (3); or, they may occur 
as a result of inadequate proof-reading » 
function of the DNA polymerase that will allow the 
removal of misincorporated nucleotides at the 3' 
end of an elongating strand. 

Researchers in the field of DNA 
sequencing often have to use several approaches to 
confirm their findings in order to avoid being 
misled by these potentially erroneous sequence 
data. For example, they sometimes rely on 
repeating the same sequencing experiment with 
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different^NA polymerases, or performing another 
sequencing reaction with the template which is 
complementary to the first single-stranded DNA 
template, and compare the results for possible 
5 discrepancies. 

Numerous investigators have tried to 
find an ideal DNA polymerase for enzymatic 
sequencing, i.e. an enzyme that not only has all 
the essential properties required for sequencing 
10 reaction, but also is capable of resolving the 

secondary hairpin structures and preventing the 
formation of strands containing nucleotides non- 
complementary to those of the template being 
sequenced. 

15 The discovery by Ye and Hong (4) of the 

thermostable large fragment of DNA polymerase 
isolated from Bacillus stearothermophilus (Bst) , 
an enzyme that is functional over the temperature 
range between 25°C and 75°C, but is most active at 

20 65°C, and possesses all the essential properties 

for DNA sequencing, has largely solved the problem 
caused by secondary structures in the template 
since these secondary structures are destabilized 
when the sequencing reaction is carried out at 

25 65°C. In the past few years since this enzyme was 

made commercially available under the name of Bst 
DNA Polymerase (Bio-Rad Laboratories) , independent 
reports have confirmed that during sequencing 
reaction catalyzed by this enzyme all four dNTPs, 

30 including dCTP> and other nucleotide analogs, such 

as dITP and 7-deaza-dGTP, are incorporated equally 
effectively in the chain elongation, , thus 
eliminating the weak "C" band phenomena often 
observed when other DNA polymerases are used, and 

35 producing a very good band uniformity on the 

sequencing gel. It has been further established 
that at this elevated temperature Bst DNA 
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pqlymerasWsystem can be used both for - the classic 
Sanger one -step reaction as well as for the 
"labeling/termination 11 sequencing reaction, 
double -stranded DNA sequencing, and the 
incorporation of 35 S-labeled nucleotides, and 32 P- 
labeled nucleotides. Since this system can be 
placed at room temperature for at least two weeks 
without significant loss of its enzymatic 
activity, it has been adapted for automation of 
DNA sequencing which requires a stable DNA 
polymerase, using either fluorescent dye or 
radioactive isotope labeling. (See also 9, 12, and 
13.) 

However, when this Bst enzyme is used 
for automated fluorescent DNA sequencing, only 
partially satisfactory results have been obtained 
with fluorescent dye-labeled primers (see 12 and 
EG Bulletin 1771 of Bio-Rad Laboratories) , and 
even less satisfactory results are obtained with 
fluorescent dye-labeled ddNTP terminators . Even 
when fluorescent dye- labeled primers are used, a 
significant number of mismatched ddNTPs are 
incorporated onto the 3' end of the extending 
nucleotides in the enzymatic reaction, thus 
generating erroneous sequencing data (see Bio-Rad 
EG Bulletin 1771) . With this in mind, the 
inventors sought, and found, a better DNA 
polymerase for DNA sequencing, especially for 
automated fluorescent dye- labeled primer and 
fluorescent dye- labeled terminator sequencing. 

Another disadvantage of the Bst DNA 
polymerase currently known in the art is its lack 
of 3' -5' exonuclease activity (5), and 
specifically, proof-reading 3' -5' exonuclease 
activity. A survey of the sequencing data 
collected from fourteen research centers which 
have used this Bst DNA polymerase for their DNA 



sequencif^T work on over 120 DNA clones showed 
that, statistically, base pair mismatching occurs 
at a rate of about 1.5 x 10" 5 . That is, 
approximately 1 . 5 errors can be expected in one 
hundred thousand nucleotide incorporations during 
nucleotide polymerization catalyzed by the enzyme. 

It is generally known that the formation 
of incorrect DNA sequences due to mismatching of 
base pairs between the template and the growing 
nucleotide chain in DNA sequencing may be 
prevented by a 3' -5' exonuclease activity which 
"proof-reads" the nucleotide chain. However, even 
if a DNA polymerase exhibits 3' -5' exonuclease 
activity in vitro, it is often the case that the 
polymerase will not adequately "proof-read" . 
Thus, the polymerase will not be capable of 
removing mismatched nucleotides from a newly 
formed DNA strand as efficiently as those 
nucleotides correctly matched with the nucleotides 
of the template. In other words, a 3 / -5 / 
exonuclease may excise the correctly matched 
nucleotides at a faster rate than the mismatched 
ones from the 3' terminus, or excise both the 
correctly matched and the mismatched nucleotides 
at the same rate. Consequently, even where the 
DNA polymerase has 3' -5' exonuclease activity, it 
does not perform any useful proof-reading function 
during DNA polymerization. 

It is also known that a 3' -5' 
exonuclease activity associated with a DNA 
polymerase, in the presence of low concentrations 
of dNTPs, often counteracts the normal chain 
elongation processl catalyzed by the polymerase, 
induces cyclic incorporation and degradation of 
nucleotides over the same segment of template, or 
even operates more efficiently than the polymerase 
activity per se, to the extent of causing 



degradation of the primer. Consequently, removal 
of the 3' -5' exonuclease activity along with the 
5' -3' exonuclease activity from the native DNA 
polymerases by chemical means or by genetic 
engineering techniques has become a standard 
procedure in producing DNA polymerases for 
sequencing. This is a common strategy to preserve 
the essential properties of a DNA polymerase. 

For example, among the major 
commercially available sequencing enzymes (other 
than the native Taq (Thermus aquaticus) DNA . 
polymerase which lacks a 3' -5' exonuclease 
activity de novo) the 3' -5' exonuclease activity 
has been removed from the native T7 DNA 
polymerase, which lacks a 5' -3' exonuclease, 
either by a chemical reaction that oxidizes the 
amino acid residues essential for the exonuclease 
activity (Sequenase™ Version 1) or genetically by 
deleting 28 amino acids essential for the 3 '-5' 
exonuclease activity (Sequenase™ 2) . 

Vent R (exo~) DNA polymerase, which is 
recommended as the preferred form of the Vent DNA 
polymerase for sequencing, also has its 3' -5' 
exonuclease activity removed by genetic 
modification. The native Vent DNA polymerase and 
the Klenow fragment isolated from the native 
coli DNA polymerase I possess a 3' -5' exonuclease; 
but these enzymes are no longer considered the 
enzymes of choice for DNA sequencing. 

The currently known Bst DNA polymerase 
(e.g., produced by Bio-Rad Laboratories) isolated 
and purified from the cells of Bacillus 
stearothermophilus for DNA sequencing is free of 
3'-5' exonuclease activity (5). 

IsoTherm™ DNA Polymerase, a commercially 
available Bst DNA polymerase for DNA sequencing, 
marketed by Epicentre Technologies (1402 Emil 
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Street, MWTison, WI 53713), is also based on a Bst 
DNA polymerase whose 3' -5' exonuclease activity 
has been enzymatically removed (6) . 

Only the rBst DNA Polymerase produced 
from an over -expressing recombinant clone in E. 
coli , which is the product of the DNA pol I gene 
of Bacillus stearo thermophilics, possesses a 3' -5' 
exonuclease activity in addition to a 5' -3' 
exonuclease activity. However, due to the 
existence of an undesirable 5' -3' exonuclease 
activity and a 3' -5' exonuclease activity of 
unknown characteristics, the latter product is not 
recommended by the company for DNA sequencing (6) . 

Over the past 10 years there has been a 
trend to develop and improve the automated 
fluorescent DNA sequencing technology to replace 
the classic radioactive isotope labeling manual ^ 
method for DNA sequencing because of the potential 
harmful effects of the radioactive materials to 
humans and because of the need for automated high 
throughput DNA sequencing systems. In using 
fluorescent dyes as markers for labeling the DNA 
strands generated in enzymatic reactions for 
sequencing, the dyes can be either coupled with 
the primer, or coupled with the ddNTP terminators, 
namely the dye- labeled ddATP, dye -labeled ddCTP, 
dye -labeled ddGTP and dye -labeled ddTTP. 
Sequencing techniques based on these two forms of 
labeling of the final enzymatic reaction products 
are commonly referred to as "dye primer 
sequencing" and "dye terminator sequencing", 
respectively. 

In the dye primer sequencing, ddNTPs are 
employed as the chain terminators, as in the 
original classic Sanger method which uses 
radioactive isotope as the marker. The molecular 
structure of ddNTPs are almost identical to that 



of dNTPg^the natural building blocks of all DNA 
molecules. Therefore, any DNA polymerase which 
has been used for radioactive isotope manual DNA 
sequencing can be easily adapted for fluorescent 
dye primer DNA sequencing with equally 
satisfactory results. The disadvantage in the dye 
primer technology is that the primer for each 
template to be sequenced must be labeled with four 
different fluorescent dyes and that the enzymatic 
reaction must be performed in four separate test 
tubes each containing only one of the ddNTPs, 
namely ddATP, ddCTP, ddGTP or ddTTP, as in the 
classic Sanger radioisotope method. 

In the dye terminator technology for DNA 
sequencing, the fluorescent dye- labeled ddATP, 
dye-labeled ddCTP, dye-labeled ddGTP and dye- 
labeled ddTTP are coupled with different 
fluorescent dyes, each emitting a specific light 
spectrum, thus directly reporting the type of 
ddNTP at the 3' terminus of the DNA fragment. 
Unlike the situations in the dye primer technology 
in which four different fluorescent dyes are 
coupled to a primer incorporated into all newly 
formed DNA strands, these dye -labeled ddNTPs serve 
the dual function of a specific base terminator 
and a "color marker". There is no need to label 
the primer for each new template, and the 
polymerase DNA extension reaction can be performed 
in a single test tube to generate the required 
specifically terminated and specifically dye- 
labeled DNA fragments of various sizes for DNA 
sequencing . 

The advantage of using fluorescent dye- 
labeled terminators for DNA sequencing is obvious. 
However, there are certain difficulties to 
overcome before an enzymatic reaction system 
suitable for a radioisotope technique or suitable 



for a dy^^rimer technique can be adapted for a 
dye terminator technology. An increase of the 
molecular weight from less than 500 for a ddNTP 
terminator to about 800 or more for a fluorescent 
dye -labeled ddNTP terminator may be associated 
with potential three-dimensional structural 
changes. These molecular alterations may 
interfere with the process of incorporation of the 
dye-labeled ddNTPs as chain terminators by the DNA 
polymerase to the 3' end of an extending DNA 
strand in terms of lowering the rate of 
incorporation, lowering the processivity of the 
enzyme for this new substrate, reducing the 
enzyme -terminator binding specificity and changing 
the enzyme -terminator binding kinetics. 

For example, both Taq DNA polymerase and 
Sequenase II™ (a T7 DNA polymerase) have been used 
for radioisotope labeling DNA sequencing with 
excellent results, and have been adapted for 
fluorescent dye-labeled primer DNA sequencing. 
But neither can be used for fluorescent dye- 
labeled terminator DNA sequencing technologies. 
As reported in U.S. Patent 5,614,365, when the Taq 
DNA polymerase was used for fluorescent dye- 
labeled terminator chemical reactions, the 
reaction products generated no readable data on 
the DNA sequencer. Most of the fluorescence was 
either in unincorporated dye -ddNTPs at the leading 
front of the test gel, or in fragments greater 
than several hundred bases in length. Using a Taq 
DNA polymerase mutant In which the amino acid, 
phenylalanine, at position 667 of its amino acid 
sequence has been replaced by a tyrosine and which 
has an increased ability to incorporate 
dideoxynucleotides (6,000 times more efficient), 
to replace the unmodified Taq DNA polymerase for 
the experiment, the results are significantly 



improved .^^This F667Y mutant of Taq DKfT polymerase 
is now marketed by Amersham Life Science, Inc. 
under the trademark ThermoSequenase™. It is used 
for cycle -sequencing in which the enzymatic 
reaction mixture is subjected to numerous cycles 
of extension- termination, denaturing and annealing 
to ensure that sufficient dye-terminator-labeled 
enzymatic reaction products are generated for the 
DNA sequencing procedure. Because of the low 
processivity of the parent Taq DNA polymerase, 
ThermoSequenase™ is not recommended for direct DNA 
sequencing without precyclings. Like Taq DNA 
polymerase, ThermoSequenase™ lacks a proof-reading 
exomiclease activity. 

Bacillus stearothexmophilus, Bacillus 
caldotenax and Bacillus caldolyticus are 
classified as mesophilic microbes; although their 
DNA polymerases are referred to as thermostable 
(most active at 65 *C) they are inactivated at 70 *C 
or above. This is contrasted with other enzymes, 
such as Taq, which are truly thermophilic--that 
is, their DNA polymerases tolerate and remain 
active at temperatures higher that 95 *C. These 
mesophilic bacillus strains, especially Bacillus 
stearothermophilus, produce DNA polymerases that 
are useful in DNA sequencing applications. 
However, a disadvantage of the DNA polymerases of 
these strains is that during DNA sequencing they 
all exhibit a high degree of selective 
discrimination\ against incorporation of certain 
particular members of fluorescent dye-labeled 
ddNTPs, namely the fluorescent dye -labeled ddCTP 
and fluorescent dye- labeled ddATP, as terminators 
onto the 3' end of the extending DNA fragments 
during enzymatic reaction. This peculiar 
characteristic of selective discrimination against 
incorporation of fluorescent dye- labeled ddCTP and 



ddATP of 




natural DNA polymerases 




Elated from 



Bacillus stearothermophilus and Bacillus 
caldotenax was not previously recognized. Such 
selective discrimination is apparently sequence- 
related, and cannot be corrected or compensated by 
mere adjustment of the concentrations of the 
dNTPs . 



bacillus DNA polymerase that does not selectively 
discriminate against incorporation of fluorescent 
dye -labeled ddCTP and ddATP, during dye primer or 
dye terminator DNA sequencing. 



described problems associated with mesophilic 
bacillus DNA polymerases by providing novel DNA 
polymerases which, during direct DNA sequencing, 
reduce the innate selective discrimination against 
the incorporation of fluorescent dye-labeled ddCTP 
and fluorescent dye-labeled ddATP, without 
increasing the rate of incorporation of the other 
two dye- labeled ddNTP terminators (ddTTP and 
ddGTP) excessively. In particular, this invention 
provides a novel genetic modification of the amino 
acid sequence of a highly processive DNA 
polymerase (such as isolated from Bacillus 
stearothermophilus , Bacillus caldotenax or 
Bacillus caldolyticus) that, unmodified, 
selectively discriminates against incorporation of 
fluorescent dye -labeled dideoxynucleotide 
terminators ddATP and ddCTP (but does not 
discriminate against incorporation of fluorescent 
dye-labeled dideoxynucleotide terminators ddTTP 
and ddGTP) . The modification results in a 
reduction of the innate selective discrimination 
against incorporation of fluorescent dye- labeled 



Thus, there is a need for a mesophilic 
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This invention addresses the above- 
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otide terminators ddATP a: 




ddCTP, 



such that all four of the ddNTP terminators are 
effectively incorporated into the DNA primer 
elongated by the DNA polymerase. Thus, the 
modified DNA polymerase of this invention is 
effective in reducing the innate selective 
discrimination against incorporation of 
fluorescent dye-labeled dideoxynucleotide 
terminators ddATP and ddCTP characteristic of the 
DNA polymerase in its unmodified state. 



polymerase is a modification of a DNA polymerase 
isolated from a strain of a mesophilic bacterium, 
such as Bacillus s tea.ro thermophi lus , Bacillus 
caldotenax or Bacillus caldolyticus . The approach 



may be used to modify other DNA polymerases which 
share a close amino acid homology of a DNA 
polymerase isolated from a strain Bacillus 
s tearo thermophi lus , Bacillus caldotenax or 
Bacillus caldolyticus, as long as the unmodified 
DNA polymerases have a selective discrimination 
against incorporation of fluorescent dye-labeled 
dideoxynucleotide ddCTP and/or ddATP as 
terminators in the enzymatic reaction for 
preparing materials for automated fluroescent DNA 
sequencing. Consequently, it is preferred that 
the modified DNA polymerase has an amino acid 
sequence that shares not less than 95% homology of 
a DNA polymerase isolated from a strain of 
Bacillus s tearo thermophi lus , Bacillus caldotenax 
or Bacillus caldolyticus . 



species is Bacillus s tearo thermophi lus , which is 
highly heterogeneous. This is indicated by the 
wide range of DNA base compositions as well as the 
range of the phenotypic properties of strains 



In particular, the preferred DNA 



of modifying the DNA polymerase described herein 



The particularly preferred mesophilic 
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assigned 




this species (see Bergey 




[anual of 



Systemic Bacteriology, Eds. P.H.A. Sneath, N.S. 
Mair, M.E. Sharpe and J.G. Holt, Williams & 
Wilkins, 1986, Vol. 2, page 1135). Therefore, it 
is reasonable to assume that the amino acid 
sequences of DNA polymerases isolated from various 
strains would be heterogeneous with potential 
functional differences. Although DNA polymerases 
isolated from the known standard strains of 
Bacillus stearothermophilus have been shown to 
lack a 3' -5' exonuclease activity, a questionable 
trace of "contaminating" 3' 5' exonuclease has been 
observed in a purified DNA polymerase preparations 
(see Kaboev et al . , J. Bacteriology, Vol. 145, 
page 21-26, 1981) . 



address the above- identified problems in the art 
by discovering a strain of Bacillus 
stearothermophilus (designated strain No. 320 for 
identification purposes; described in U.S. 
5,747,298) that produces a DNA polymerase 
(designated Bst 320) with a proof-reading 3' -5' 
exonuclease activity which is absent in DNA 
polymerases isolated from other strains of 
Bacillus stearothermophilus . (For this invention, 
the term "proof-reading" is intended to denote 
that the DNA polymerase is capable of removing 
mismatched nucleotides from the 3' terminus of a 
newly formed DNA strand at a faster rate than the 
rate at which nucleotides correctly matched with 
the nucleotides of the template are removed during 
DNA sequencing.) The strain Bst 320 was deposited 
on October 30, 1995 in the American Type Culture 
Collection, located at 12301 Parklawn Drive, 
Rockville, Maryland 2 0852, and has been given ATCC 
Designation No. 55719. The DNA polymerase 
isolated from Bst 320 is composed of 587 amino 



Consequently, the inventors began to 
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3^^e the DNA polymerases of c^^i 



acids aspire the DNA polymerases of OCTrer known 
strains of Bacillus stearothermophilus , such as, 
for instance, the strains deposited by Riggs et al 
(Genbank Accession No. L42111) and by Phang et al . 
5 (Genbank Accession No. U23149) . However, the Bst 

320 shares only 89.1% sequence identity at protein 
level with the Bacillus stearothermophilus DNA 
polymerase deposited by Riggs et al . , and shares 
only 87.4% sequence identity at protein level with 
10 the Bacillus stearothermophilus DNA polymerase 

deposited by Phang et al . For comparison, the 
above -referenced enzyme deposited by Riggs et al . 
and the enzyme deposited by Phang et al . share 
96.9% of their amino acid sequence identity. 
15 The inventors studied a thermostable DNA 

polymerase isolated from a different species, 
Bacillus caldotenax (Bca) , which also has an 
optimum active temperature at 65 °C. The inventors 
discovered that the Bst 320 DNA polymerase shares 
20 88.4% of the amino acid sequence identity with Bca 

DNA polymerase (Uemori et al . J. Biochem. 113: 
401-410, 1993) . Based on homology of the amino 
acid sequences, Bst 320 DNA polymerase is as close 
S to DNA polymerases isolated from Bacillus 

?ea * 25 stearothermophilus as to the DNA polymerase 

isolated from Bacillus caldotenax, i.e. another 
species of bacillus. It was also discovered that 
both Bst 32 0 DNA polymerase and Bca DNA polymerase 
functionally exhibit 3' -5' exonuclease activity, 
30 which is not associated with known amino acid 

sequence exonuclease motifs I, II and III as in 
the E. coli DNA polymerase I model, or other known 
Bacillus stearothermophilus polymerases. 

The inventors has studied the DNA 
35 polymerases of three different strains of Bacillus 

stearothermophilus (including DNA polymerase 
obtained from Bst 320) and the DNA polymerase of 
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Bacillus ^aldotenax and found that they all 
exhibit a high degree of selective discrimination 
against incorporation of certain particular 
members of fluorescent dye -labeled ddNTPs, namely 
the fluorescent dye -labeled ddCTP and fluorescent 
dye-labeled ddATP, as terminators onto the 3' end 
of the extending DNA fragments during enzymatic 
reaction. This is especially the case when the 
preceding 3 ' end base of the extending DNA 
fragment is a dGMP (G) or a dAMP (A) . (By u dNTP" 
it is intended to denote the four commonly known 
deoxynucleotide triphosphates, dATP, dTTP, dCTP, 
and dGTP . ) 

This selective discrimination causes 
missing peaks and ambiguous peaks on a color plot 
generated by the automated fluorescent DNA 
sequencer, and causes loss of sequencing data and 
erroneous base callings. This is shown in Figures 
6 and 8 . 

This disadvantage of the natural 
bacillus DNA polymerases in fluorescent dye- 
labeled terminator DNA sequencing cannot be 
corrected or compensated by mere adjustment of the 
concentrations of the dNTPs and the fluorescent 
dye-labeled ddNTPs in the reaction mixture. This 
selective discrimination against the specific dye- 
labeled ddNTPs is also sequence-related as 
demonstrated with respect to Bst in Figures 6 and 
8, in which the missing or ambiguous "C" peaks and 
"A" peaks tend to occur immediately following a 
preceding "G" peak or a preceding "A" peak. Of 
particular interest is the fact that the "C" and 
"A" peaks immediately following a preceding "C" or 
a preceding "T" peak are quite strong and 
resolvable in the same color plot analysis, 
indicating that the concentrations of dNTPs and 
the fluorescent dye -labeled ddCTP and the 
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fluoresc^^ dye -labeled ddATP were adequate for 
the termination reaction. 

According to the structural model 
studies carried out on E. coli DNA polymerase I 
(Klenow fragment) , certain amino acids in a 
particular region or regions of a DNA polymerase 
appear to play important roles in dNTP and ddNTP 
bindings and their final incorporation, and affect 
discrimination between deoxy and dideoxynucleotide 
substrates. For example, mutation of the amino 
acids arginine, asparagine, lysine, tyrosine, 
phenylalanine, aspartate, and glutamate in certain 
locations of amino acid sequences of Klenow 
fragment may affect the binding of dNTP and 
discrimination between deoxy and 

dideoxynucleotides . (See: Joyce, CM., Current 
Opinion in Structural Biology, 1:123-129, 1991. 
Joyce and Steitz, Annu. Rev. Biochem. , 63:777-822, 
1993, page 800. Carrol et al . , Biochemistry 
30 :804-813, 1991) . 

The problem which faced the inventors 
was how to reduce the selective discrimination 
against the incorporation of fluorescent dye- 
labeled ddCTP and fluorescent dye- labeled ddATP by 
site-directed mutagenesis of a DNA polymerase, 
without increasing the rate of incorporation of 
the other two dye -labeled ddNTP terminators 
excessively. In particular, the new mutant must 
be able to incorporate more correctly base-matched 
dye- labeled ddCTP and/ or dye -labeled ddATP 
terminators to the dGMP (G) and dAMP (A) bases, 
than to the dCMP (C) and dTMP (T) bases of the 
extending DNA fragments during enzymatic reaction. 
A blanket increase in the ability of an enzyme to 
incorporate all four dye -labeled ddNTPs to the 
same proportion would serve no useful purpose for 
the group of DNA polymerases isolated from 
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mesophili 




tacilli since, unlike the 




DNA 



10 




in 
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polymerase, the unmodified natural enzymes of 
Bacillus stearothermophilus and Bacillus 
caldotenax already possess a high ability to 
incorporate fluorescent dye -labeled ddGTP and 
fluorescent dye-labeled ddTTP, and even the 
fluorescent dye-labeled ddCTP and dye-labeled 
ddATP provided at the immediately preceding base 
at the 3 'end of the extending DNA fragment is not 
a "G" or an "A" . 

The inventors found that DNA polymerases 
isolated from strains of Bacillus 

stearothermophilus and Bacillus caldotenax possess 
the same amino acids at certain specific positions 
in their amino acid sequence. For example, they 
all have leucine -glutamate-glutamate at positions 
corresponding to positions 342-344 and 
phenylalanine at a position 

corresponding to position 422 of the amino acid 
sequence of the DNA polymerase isolated from No 
320 strain of Bacillus stearothermophilus . The 
inventors further discovered that the most optimal 
modification to solve the problem of selective 
discrimination in direct fluorescent DNA 
sequencing for these DNA polymerases is to modify 
the four amino acids of the natural DNA 
polymerases referenced above in such a form that 
threonine -proline -leucine substitute respectively 
for leucine-glutamate-glutamate at positions 342- 
344 and tyrosine substitutes for phenylalanine at 
position 422 in their amino acid sequences. 
Accordingly, the nucleotide sequence encoding the 
natural forms of the DNA polymerases are modified 
at positions 1024-1032 from CTCGAAGAG to ACCCCACTG 
and at position 1265 from T to A to encode for the 
DNA polymerases having the desired properties. 
The combined effects of these amino acid 
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modif icalSrons reduce the selective discrimination 
against incorporation of fluorescent dye-labeled 
ddCTP and dye -labeled ddATP of the naturally- 
occurring mesophilic bacillus DNA polymerases 
during enzymatic reaction for direct automated 
fluorescent DNA sequencing. 

Initially, the DNA polymerases used in 
the inventors' research were obtained by 
overexpression of the genes encoding the 
naturally-occurring enzymes of Bacillus 
stearothermophilus and Bacillus caldotenax. 
Subsequently, modified DNA polymerases obtained by 
overexpression of the site-directed mutated genes 
were used. This invention provides both the 
nucleotide and amino acid sequence for a modified 
DNA polymerase to illustrate the practice of this 
new approach of modifying a special group of DNA 
polymerases, as described below. 

In one preferred embodiment, the Bst 320 
DNA polymerase is used for the unmodified, 
naturally-occurring DNA polymerase, although DNA 
polymerases isolated from other strains of 
mesophilic bacilli (for instance, Bacillus 
stearothermophilus and Bacillus caldotenax) can be 
used as the starting enzymes for the genetic 
modification. As noted above, the Bst 32 0 DNA 
polymerase is also capable of proofreading 3' -5' 
exonuclease activity. In particular, the 
invention provides the DNA and amino acid 
sequences for the isolated and purified DNA 
polymerase having this function. These sequences 
are also described below. 

The invention also contemplates an 
isolated strain of Bacillus stearothermophilus 
which produces a DNA polymerase having an ability 
to reduce selective discrimination against 
incorporation of fluorescent dye-labeled 



eotide terminators ddCTP ana 



ili 



dideoxynuS?eotide terminators ddCTP ana ddATP, but 
not fluorescent dye- labeled dideoxynucleotide 
terminators ddGTP and ddTTP, in the presence of 
dNTPs and the four fluorescent dye-labeled 
5 dideoxynucleotide terminators. Preferably, the Bst 

strain produces a DNA polymerase which also has 
proofreading 3' -5' exonuclease activity during DNA 
sequencing of a DNA strand from a template. 

As mentioned above, the invention also 

10 contemplates DNA polymerases obtained or otherwise 

derived from any bacillus strain, or made 
synthetically, as long as the amino acid sequences 
of the naturally- occurring DNA polymerases have 
leucine -glut amate-glutamate at positions 

15 corresponding respectively to positions 342-344 of 

Bst 320 DNA polymerase and phenylalanine at a 
position corresponding to position 422 of Bst 320 
DNA polymerase. For example, DNA polymerases 
derived from other strains of Bacillus 

2 0 stearothexmophilus or Bacillus caldotenax or other 

mesophilic bacilli may be easily modified using 
conventional DNA modification techniques to 
include the amino acid or nucleotide substitutions 
identified above . 
25 The invention also provides a DNA 

construct comprising at least one of the above- 
described DNA polymerase sequences and a vector 
(such as a cloning vector or an expression 
vector) , for introducing the DNA construct into 

3 0 eucaryotic or procaryotic host cells (such as an 

E. coli host cell) . In addition, the invention 
further provides a host cell stably transformed 
with the DNA construct in a manner allowing 
production of the peptide encoded by the DNA 
35 segment in the construct. 

The invention also provides improved 
methods for replicating DNA and sequencing DNA 
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using th^roove -described DNA polymer^PIs of the 
invention. The DNA polymerases are useful in both 
direct dye terminator DNA sequencing and dye- 
primer DNA sequencing . 

Preferably, the method of sequencing a 
DNA strand may comprise the steps of: 

i) hybridizing a primer to a DNA template to 
be sequenced; 

ii) extending the primer using a DNA 
polymerase which has an ability to reduce 
selective discrimination against incorporation of 
fluorescent dye -labeled dideoxynucleotide 
terminators ddCTP and ddATP, in the presence of 
adequate amounts of nucleotide bases dATP, dGTP, 
dCTP and dTTP, or their analogs, and the four 
fluorescent dye-labeled dideoxynucleotide 
terminators, 

under such conditions that the DNA strand is 
sequenced. 

Further objects and advantages of the 
invention will become apparent from the 
description and examples below. 



BRIEF DESCRIPTION OF THE DRAWINGS 
In the Figures and throughout this 
disclosure, "HiFi Bst" or "Bst 320" DNA polymerase 
refers to the unmodified naturally occurring DNA 
polymerase having proofreading 3' -5' exonuclease 
activity, either isolated from the cells of No. 
32 0 strain of Bacillus stearotheiwophilus or 
produced by overexpression of the gene encoding 
this naturally occurring DNA polymerase. (This Bst 
strain and DNA polymerase are described in U.S. 
Patent 5,747,298.) "HiFi Bst-II" refers to the 
modified form of "HiFi Bst" DNA polymerase which 
has an ability to reduce selective discrimination 




against 'Wiorescent dye -labeled ddCT^PTnd ddATP. 
HiFi Bst-II is an example of one preferred 
embodiment of this invention. 

Figure 1. This graph shows the 
thermostability at 65°C of HiFI Bst-II and HiFI 
Bst . 

Y: relative polymerase activity (%) 
X: incubation time (minutes) . 
Figure 2 . This shows a autoradiograph of 
a DNA sequencing gel obtained by using 
radiolabeled primer with HiFi Bst-II and HiFi Bst, 
and shows the dideoxy-nucleotide incorporation of 
HiFi Bst-II and HiFi Bst in a reaction mixture 
with a suboptimally low ddNTP/dNTP ratios. 

Template: single- stranded M13mpl8; 
Primer: -20M13 forward primer. 
Figure 3 . This shows a autoradiograph of 
a DNA sequencing gel obtained by using 
radiolabeled dATP with HiFi Bst and HiFi Bst-II in 
reaction mixtures with optimized ddNTP/dNTP 
ratios. The sequence pattern with HiFi Bst-II is 
better than that with HiFi Bst. 

Template: single-stranded M13mpl8; 
Primer: -20M13 forward primer. 
Figure 4. This shows the results of dye- 
primer DNA sequencing with HiFi Bst 

Template: single-stranded pGEM-3Zf(+); 
Primer: -21M13 forward DYEnamic 
Energy Transfer Dye Primers. 

Figure 5. This shows the results of dye- 
primer DNA sequencing with HiFi Bst-II. 

Template: single- stranded M13mpl8; 
Primer: -21M13 forward DYEnamic 
Energy Transfer Dye Primers. 

Figure 6. This shows the results of dye- 
terminator DNA sequencing with HiFi Bst 

Template: single-stranded pGEM- 



3Zf( + ); 

Primer: -20M13 forward primer. 
Figure 7. This shows the results of dye- 
terminator DNA sequencing with HiFi Bst-II. 

Template: single-stranded M13mpl8; 
Primer: -20M13 forward primer. 
Figure 8. Like Figure 6, this shows the 
results of four fluorescent dye-labeled 
terminators DNA sequencing with HiFi Bst. In 
Figure 8 corrections of the missing or ambiguous 
bases, according to the known pGEM sequence, are 
indicated below the letters "N" or below the 
incorrect base letters. 

Template: single-stranded pGEM- 

3Zf ( + ) ; 

Primer: -20M13 forward primer. 

DETAILED DESCRIPTION OF THE INVENTION 

The DNA polymerases of the invention are 
capable of reducing selective discrimination 
against incorporation of fluorescent dye-labeled 
dideoxynucleotide terminators ddCTP and ddATP, 
(but not ddGTP and ddTTP) , in the presence of 
adequate amounts of dNTPs and the four 
terminators . 

The inventors discovered that certain 
modifications of the amino acid sequence of DNA 
polymerases (i.e., modifying the amino acids at 
positions 342-344 to substitute threonine, proline 
and leucine, respectively, for leucine, glutamate 
and glutamate, and modifying the amino acid at 
position 422 to substitute tyrosine for 
phenylalanine, as corresponding to the amino acid 
sequence of Bst 320 DNA polymerase) result in a 
marked reduction of the innate selective 
discrimination against incorporation of 



fluorescent dye-labeled dideoxynucleotide ddCTP 
and ddATP, that is characteristic of the 
unmodified DNA polymerase. This reduction of 
selective discrimination can be demonstrated by 
direct automated fluorescent terminator DNA 
sequencing as recovered missing or ambiguous "C" 
peaks and/or "A" peaks in the automated sequencing 
results, using a known template as standard. 

The preferred source for the DNA 
polymerase is Bacillus stearothermophilus because 
DNA polymerase isolated from this mesophilic 
bacillus is highly processive, can be used and 
stored in dried down form, has an optimum 
temperature at 65°C, and can be used for direct 
DNA sequencing without precycling. The preferred 
Bst DNA polymerase is that isolated from strain 
320 with an amino acid sequence as follows: 

Amino acid sequence (SEQ ID NO: 2) : 
AEGEKPLEEM EFAIVDVITE EMLADKAALV VEVMEENYHD 
APIVGIALVN EHGRFFMRPE TALADSQFLA WLADETKKKS 
MFDAKRAWA LKWKGIELRG VAFDLLLAAY LLNPAQDAGD 
IAAVAKMKQY EAVRSDEAVY GKGVKRSLPD EQTLAEHLVR 
KAAAIWALEQ PFMDDLRNNE QDQLLTKLEH ALAAILAEME 
FTGVNVDTKR LEQMGSELAE QLRAIEQRIY ELAGQEFNIN 
SPKQLGVILF EKLQLPVLKK TKTGYSTSAD VLEKLAPHHE 
IVENILHYRQ LGKLQSTYIE GLLKWRPDT GKVHTMFNQA 
LTQTGRLSSA EPNLQNIPIR LEEGRKIRQA FVPSEPDWLI 
FAADYSQIEL RVLAHIADDD NLIEAFQRDL DIHTKTAMDI 
FQLSEEEVTA NMRRQAKAVN FGIVYGISDY GLAQNLNITR 
KEAAEFIERY FASFPGVKQY MENIVQEAKQ KGYVTTLLHR 
RRYLPDITSR NFNVRSFAER TAMNTPIQGS AADIIKKAMI 
DLAARLKEEQ LQARLLLQVH DELILEAPKE EIERLCELVP 
EVMEQAVTLR VPLKVDYHYG PTWYDAK 



The characters represent the following amino 
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acids : 
where , 

A: alanine (Ala) M: methionine (Met) 

C: cysteine (Cys) N: asparagine (Asn) 

5 D: aspartic acid (Asp) p : proline (Pro) 

E: glutamic acid (Glu) Q : glutamine (Gin) 

F: phenylanaline (Phe) R : arginine (Arg) 

G: glycine (Gly) S: ser ine (Ser) 

H: histidine (His) T: threonine (Thr) 

10 I: isoleucine (He) V: valine (Val) 

K: lysine (Lys) W: tryptophan (Trp) 

L: leucine (Leu) Y : tyrosine (Tyr) 

The Bst 320 DNA polymerase is 
characterized by possessing a proofreading 3' -5' 
15 exonuclease activity. 

The nucleotide sequence encoding the 
unmodified Bst 320 DNA polymerase is indicated in 
SEQ ID NO:l, in Example 2 below. 

The following amino acid sequence 
represents the modified Bst 320 DNA polymerase as 
the preferred embodiment of this invention, 
modified from the naturally-occurring Bst 320 DNA 
polymerase at positions 342-344 to substitute 
threonine, proline and leucine, respectively, for 
leucine, glutamate and glutamate, and at position 
422 to substitute tyrosine for phenylalanine. 

Amino acid sequence (SEQ ID: No 4) : 
MAEGEKPLEEME F AI VDVI TEEMLADKAALWEVMEENYHDAP I V- 
GIALVNEHGRFFMRPETALADSQFLAWLADETKKKSMFDAKRAW 
ALKWKG I ELRGVAFDLLLAAYLLNPAQDAGD I AA VAKMKQYEAVR 
SDEAVYGKGVKRS LPDEQTLAEHLVRKAAAI WALEQP FMDDLRNN 
EQDQLLTKLEHALAAILAEMEFTGVNVDTKRLEQMGSELAEQLRA 
IEQRIYELAGQEFNINSPKQLGVILFEKLQLPVLKKTKTGYSTSA 
DVLEKLAPHHEIVENILHYRQLGKLQSTYIEGLLKWRPDTGKVH. 
TMFNQALTQTGRLS SAEPNLQNI P IRTPLGRKI RQAFVPSEPDWL 

IFAADYSQIELRVLAHIADDDNLIEAFQRDLDIHTKTAMDIFQLS 
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EEE 1 




.rqakavnygivygisdyglaqn: 




'RKEAAEFIER 



YFASFPGVKQYMENIVQEAKQKGYVTTLLHRRRYLPDITSRNFNV 
RS FAERTAMNT P I QGSAAD 1 1 KKAM I DLAARLKEEQLQARLLLQV 
HDEL I LEAPKEE I ERLCELVPEVMEQAVTLRVPLKVDYHYGPTWY 
DAK 

The underlined amino acids are substituted amino 
acids produced by site-directed mutation of the 
naturally- occurring Bst 320 DNA polymerase. 



The modified Bst 320 DNA polymerase is 



encoded by a DNA sequence such as the following 
(SEQ ID NO: 3) : 



GCCGAAGGGG AGAAACCGCT TGAGGAGATG GAGTTTGCCA 
TCGTTGACGT CATTACCGAA GAGATGCTTG CCGACAAGGC 
AGCGCTTGTC GTTGAGGTGA TGGAAGAAAA CTACCACGAT 
GCCCCGATTG TCGGAATCGC ACTAGTGAAC GAGCATGGGC 
GATTTTTTAT GCGCCCGGAG ACCGCGCTGG CTGATTCGCA 
ATTTTTAGCA TGGCTTGCCG ATGAAACGAA GAAAAAAAGC 
ATGTTTGACG CCAAGCGGGC AGTCGTTGCC TTAAAGTGGA 
AAGGAATTGA GCTTCGCGGC GTCGCCTTTG ATTTATTGCT 
CGCTGCCTAT TTGCTCAATC CGGCTCAAGA TGCCGGCGAT 
ATCGCTGCGG TGGCGAAAAT GAAACAATAT GAAGCGGTGC 
GGTCGGATGA AGCGGTCTAT GGCAAAGGCG TCAAGCGGTC 
GCTGCCGGAC GAACAGACGC TTGCTGAGCA TCTCGTTCGC 
AAAGCGGCAG CCATTTGGGC GCTTGAGCAG CCGTTTATGG 
ACGATTTGCG GAACAACGAA CAAGATCAAT TATTAACGAA 
GCTTGAGCAC GCGCTGGCGG CGATTTTGGC TGAAATGGAA 
TTCACTGGGG TGAACGTGGA TACAAAGCGG CTTGAACAGA 
TGGGTTCGGA GCTCGCCGAA CAACTGCGTG CCATCGAGCA 
GCGCATTTAC GAGCTAGCCG . GCCAAGAGTT CAACATTAAC 
TCACCAAAAC AGCTCGGAGT CATTTTATTT GAAAAGCTGC 
AGCTACCGGT GCTGAAGAAG ACGAAAACAG GCTATTCGAC 
TTCGGCTGAT GTGCTTGAGA AGCTTGCGCC GCATCATGAA 
ATCGTCGAAA ACATTTTGCA TTACCGCCAG CTTGGCAAAC 
TGCAATCAAC GTATATTGAA GGATTGTTGA AAGTTGTGCG 
CCCTGATACC GGCAAAGTGC ATACGATGTT CAACCAAGCG 



ATG 



CTGACGCAiBr CTGGGCGGCT CAGCTCGGCC GAGCCGAACT 
TGCAAAACAT TCCGATTCGG ACCCCACTGG GGCGGAAAAT 
CCGCCAAGCG TTCGTCCCGT CAGAGCCGGA CTGGCTCATT 
TTCGCCGCCG ATTACTCACA AATTGAATTG CGCGTCCTCG 
5 CCCATATCGC CGATGACGAC AATCTAATTG AAGCGTTCCA 

ACGCGATTTG GATATTCACA CAAAAACGGC GATGGACATT 
TTCCAGTTGA GCGAAGAGGA AGTCACGGCC AACATGCGCC 
GCCAGGCAAA GGCCGTTAAC TACGGTATCG TTTACGGAAT 
TAGCGATTAC GGATTGGCGC AAAACTTGAA CATTACGCGC 

10 AAAGAAGCTG CCGAATTTAT CGAACGTTAC TTCGCCAGCT 

TTCCGGGCGT AAAGCAGTAT ATGGAAAACA TAGTGCAAGA 
AGCGAAACAG AAAGGATATG TGACAACGCT GTTGCATCGG 
CGCCGCTATT TGCCTGATAT TACAAGCCGC AATTTCAACG 
TCCGCAGTTT TGCAGAGCGG ACGGCCATGA ACACGCCAAT 

15 TCAAGGAAGC GCCGCTGACA TTATTAAAAA AGCGATGATT 

GATTTAGCGG CACGGCTGAA AGAAGAGCAG CTTCAGGCTC 
GTCTTTTGCT GCAAGTGCAT GACGAGCTCA TTTTGGAAGC 
GCCAAAAGAG GAAATTGAGC GATTATGTGA GCTTGTTCCG 
GAAGTGATGG AGCAGGCCGT TACGCTCCGC GTGCCGCTGA 

20 AAGTCGACTA CCATTACGGC CCAACATGGT ATGATGCCAA 

A 

The characters represent the following 
nucleotides : 

A: Adenosine T: Thymidine 

25 C: Cytidine G: Guanosine 

The underlined nucleotides are 
substituted nucleotides produced by site-directed 
mutation of the naturally- occurring Bst 320 
polymerase . (As would be apparent to someone 

30 skilled in this art, this DNA sequence does not 

indicate the starting codon.) 

The invention also contemplates any DNA 
sequence that is complementary to the modified Bst 
320 DNA sequence, for instance, DNA sequences that 

35 would hybridize to the above DNA sequence of the 

modified DNA polymerase under stringent 
conditions. As would be understood by someone 
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skilled nx the art, the invention also 
contemplates any DNA sequence that encodes a 
peptide having these characteristics and 
properties (including degenerate DNA code) . 

This invention also contemplates allelic 
variations and mutations (for instance, adding or 
deleting nucleotide or amino acids, sequence 
recombination or replacement or alteration) which 
result in no substantive change in the function of 
the DNA polymerase or its characteristics. For 
instance, the DNA polymerases encompass non- 
critical substitutions of nucleotides or amino 
acids that would not change functionality (i.e., 
such as those changes caused by a transformant 
host cell) . In addition, the invention is 
intended to include fusion proteins and muteins of 
the unique DNA polymerases of this invention. 

The DNA sequences and amino acid 
sequences for the modified DNA polymerase of this 
invention are also obtainable by, for instance, 
isolating and purifying DNA polymerase from a 
Bacillus stearothermophilus , or a bacterial strain 
otherwise derived from Bacillus 

stearothermophilus , or other mesophilic bacillus 
strains such as Bacillus caldotenax or Bacillus 
caldolyticus. The DNA polymerases obtained from 
these organisms may be easily modified using 
conventional DNA modification techniques to 
achieve the reduction in fluorescent dye -labeled 
ddCTP and ddATP selective discrimination, as long 
as the unmodified amino acid sequences have 
leucine-glutamate-glutamate at positions 
corresponding respectively to positions 342-344 of 
Bst 320 DNA polymerase and phenylalanine at a 
position corresponding to position 422 of Bst 32 0 
DNA polymerase. For instance, using the primers 
and methods of screening described herein, someone 



skilled 



the art could isolate a DNA polymerase 




having the same properties and function from other 
strains . 



conventional DNA sequencing protocols, it is 
preferred that the enzymes have low or no 
exonuclease activity. However, in this invention, 
it is preferred that the DNA polymerases have a 
function of high fidelity ( u HiFi") nucleotide 
incorporation. Therefore, in one preferred 
embodiment the invention entails modification of a 
naturally-occurring Bst DNA polymerase having a 
proofreading 3' -5' exonuclease activity. This 
preferred modified DNA polymerase (e.g., w HiFi 
Bst -I I") has a nucleotide sequence indicated in 
SEQ ID:NO 3 and an amino sequence indicated in SEQ 
ID:NO 4. To initially obtain a Bst DNA polymerase 
having proofreading 3' -5' activity, strains of 
Bacillus stearothermophilus can be segregated into 
different groups according to the proof-reading 
exonuclease activity of their respective DNA 
polymerases . 



construct comprising at least one of the DNA 
sequences of the modified DNA polymerase and a 
vector (such as a cloning vector or an expression 
vector) , for introducing the DNA construct into 
host cells. An example of a suitable vector is 
pYZ34/LF, described below. 



being stably transformed with the DNA construct in 
a manner allowing production of the peptide 
encoded by the DNA segment in the construct 
(preferably in large quantity) . The host cells 
may be of eucaryotic or procaryotic origin (such 
as a E^ coli host cell) . For instance, the host 
cell may be a mesophilic organism, although this 



In the DNA polymerases currently used in 



The invention also provides a DNA 



The \host cells need only be capable of 





is not accessary requirement in order that a 
host cell be effective. 



methods for DNA sequencing using the above - 
described novel DNA polymerases. The methods 
entail sequencing a DNA strand by conventional 
protocols with the following modifications: 

i) hybridizing a primer to a DNA template to 
be sequenced; 

ii) extending the primer using a DNA 
polymerase described above, in the presence of 
radiolabeled dATP, nucleotides dGTP, dCTP and 
dTTP, or their analogs, and ddNTP chain 
terminators; and 

iii) allowing a DNA strand to be sequenced. 



incorporated equally effectively in the chain 
elongation during sequencing reaction catalyzed by 
the DNA polymerases of the invention with a high 
processivity and a high elongating rate. 



concentrations of modified Bst DNA polymerase used 
in radiolabeled DNA sequencing are as following: - 
A mix: dATP 0.8//M, dCTP 80/xM, dGTP 80/iM, dTTP 
80/zM, ddATP 25/xM; 

C mix: dATP 0.8//M, dCTP 8/xM, dGTP 80/iM, dTTP 
80/xM, ddCTP 20/xM; 

G mix: dATP 0.8/iM, dCTP 80/zM, dGTP 8/zM, dTTP 
80/iM, ddGTP SO/xM; 

T mix: dATP 0. N 8/xM, dCTP 80/xM, dGTP 80/zM, dTTP 
8/xM, ddTTP 50/xM. (This mixture is useful for the 
particular modified Bst 320 DNA polymerase set 
forth above, as well as for other modified Bst DNA 
polymerases . ) 



other uses of the modified DNA polymerases. For 
instance, the DNA polymerase can also be use in 



The invention also provides improved 



All four dNTPs, including dCTP, are 



Preferably the nucleotide premix 



In addition, the invention contemplates 



(1) fillmg-in 5' overhangs of DNA fragments; (2) 
synthesis of DNA probes by random primers labeling 
methodology; and (3) site-directed mutagenesis. 



The following non- limiting examples are 
illustrative of the invention. 

Example 1 : Screening for Bst polymerases 

This invention also involves a method to 
measure the proof-reading 3' -5' exonuclease 
activity of purified DNA polymerases. The method 
is useful to screen a large number of bacterial 
strains, such as Bacillus stearothermophilus and 
other mesophilic bacterial strains, to select a 
strain which produces a DNA polymerase with a high 
proof-reading 3' -5' exonuclease activity. For 
instance, the method to test the proof-reading 3'- 
5' exonuclease activity of DNA polymerase was 
carried out as follows. 

A DNA primer and two DNA templates with 
following sequences were synthesized chemically, 
using a DNA synthesizer. 

17-base primer 5' CATTTTGCTGCCGGTCA 3' 

1 mg/ml 
(SEQ ID NO: 5) 

Template (a) 3 ' GTAAAACGACGGCCAGTCTT 5 ' 

\ 10 mg/ml 

(SEQ ID NO: 6) 

Template (b) 3 ' GTAAAACGACGGCCAGTCGG 5 ' 

10 mg/ml 
(SEQ ID NO: 7) 
To produce the radiolabeled primer, 1 fil 
(1 /xg) of primer, 5 fil (50 /xg) of template (a) , 1 
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111 of [a^P] dATP (800 Ci/mmole) , 1 /iTof dGTP 
(0.5 mM) , 1 fil of Taq DNA polymerase (1 unit), and 
1 fil of buffer consisting of 500 mM Tris-Cl, pH 
9.0, and 150 mM MgCl 2/ were mixed in a test tube 
5 and incubated in a 65 *C water bath for 5 minutes. 

The mixture was subject to alkaline denaturing gel 
electrophoresis. The radioactive band containing 
the 20-base nucleotide was isolated and dissolved 
in 12 fil of 10 mM Tris-Cl buffer, containing 1 mM 
10 EDTA, pH 8.0. The final product represents the 

following labeled 20-base primer. 

5 ' CATTTTGCTGCCGGTCAGA*A* 3 ' 
(* = 32 P labeled) 
J (SEQ ID N0:8) 

lH 15 To produce radiolabeled primer- template 

\Z complexes, 5 ill of the labeled primer was mixed 

p with 10 ill of template (a) or template (b) 

=^T respectively to form the following: 

Si 

P Complex (a) 

If] 20 5 ' CATTTTGCTGCCGGTCAGA*A* 3' (same as SEQ ID NO: 8) 

: p 3 ' GTAAAACGACGGCCAGTCT T 5' (same as SEQ ID NO: 6) 



Complex (b) 

5' CATTTTGCTGCCGGTCAGA*A* 3' (same as SEQ ID NO: 8) 
3' GTAAAACGACGGCCAGTCG G 5' (same as SEQ ID NO: 7)" 

25 The free radiolabeled primer was removed through a 

G - 5 0 S ephadex \col umn . 

An aliquot of complex (a) which had two 
correctly matched radiolabeled A*s at the 3' 
terminus of the primer, and an aliquot of complex 

30 (b) which had two mismatched A*s at the 3' 

terminus of the primer, were then pipetted into 
two individual vials of scintillation fluid and 
their radioactivity was measured in a 
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scintillator! counter, and both complexes were 
adjusted with buffer to a concentration containing 
the same molarity of incorporated [a- 32 P] dAMP. 

To perform the proof-reading 3' -5' 
exonuclease activity, 20 pi of complex (a) or 
complex (b) , 8 fxl reaction buffer consisting of 15 
mM Tris-Cl and 15 mM MgCl 2/ pH 8.5, 4 units of DNA 
polymerase, and enough water to make up a total 
volume of 40 //l were pipetted into a test tube and 
mixed well. The mixture was subdivided into 
aliquots of 3 /zl each in 0.5 ml microcentrifuge 
tubes and was then covered with 3 /zl paraffin in 
each tube. The microcentrifuge tubes were 
incubated in a 65 °C water bath. At 1, 2, 3, 5, 10, 
and 20 minutes, a pair of the microcentrifuge 
tubes were taken out from the water bath and the 
content of each tube was dotted onto a DE-81 
Whatman filter paper. One of each pair of the 
filter papers was put in scintillation fluid 
directly and the radioactivity was counted in cpm 
value in a scintillation counter; the other was 
washed three times in 0.3 M sodium phosphate 
buffer, pH 6.8 before being put into the 
scintillation fluid for counting. 

The difference in radioactivity 
expressed in cpm value between the washed filter 
paper and the unwashed filter paper in each pair 
was interpreted as representing the relative 
quantity of labeled nucleotides excised by the 3'- 
5' exonuclease, activity from the 3' terminus of 
the primer. A DNA polymerase that excised the 
radiolabeled nucleotides A*s from complex (b) more 
efficiently than from complex (a) possessed proof- 
reading 3' -5 'exonuclease activity. A DNA 
polymerase that excised the radiolabeled 
nucleotides A*s from complex (a) faster than from 
complex (b) , or at nearly the same rate, was 



interpi^ffed as possessing a non-speOTic 3' -5' 
exonuclease activity which is considered 
unsuitable for DNA sequencing. 

Using these methods, a strain of 
bacteria was isolated from among the strains of 
Bacillus stearothermophilus from various sources 
which is distinguished in its fast growth rate. 
This strain reached an optimum exponential growth 
within 3 hours for DNA polymerase production. The 
strain was also able to produce a DNA polymerase 
with a proof-reading 3' -5' exonuclease activity. 
This strain of Bacillus stearothermophilus was 
labeled Bst No. 320. 

As persons skilled in this art would 
appreciate, the bacterial strain, or even the 
strain of Bacillus stearothermophilus , from which 
a mesophilic DNA polymerase of the invention can 
be obtained may be derived using the above - 
described methods or others known in the art from 
strains of Bacillus stearothermophilus or other 
bacteria strains (especially mesophilic strains) , 
including wild strains or mutant strains acquired 
by various means, including spontaneous mutation. 

To prepare the preferred purified Bst 
DNA polymerase, the cells of Bst No. 320 were 
grown at 55 °C in a liquid medium consisting of 1% 
polypeptone, 0.5% yeast extract and 0.5% NaCl, 
pH7.0-7.2. The 3 hr old cells were collected after 
centrifugation and suspended in 4 volumes of TME 
buffer (50mM Tris-HCl, pH7.5, lOmM /?- 
mercaptoethanol , and 2 mM EDTA) , containing 100 mg 
lysozyme and 23 mg pheny Ime thy 1 sulphonyl - 
fluoride/ml. The cells were broken by sonication 
in ice. The supernatant was pooled after 
centrifugation at 28,000 rpm in a Spinco L 30 
rotor. 

The purified Bst DNA polymerase of the 



inventi^Pwas prepared according to ^lazaki and 

Kornberg (7) with appropriate slight modifications 
and the large fragment of the DNA polymerase was 
obtained by partial digestion of the whole DNA 
polymerase with the proteinase subtilisin (type 
Carlsberg) basically according to Jacobsen et al . 
(8) . 

The procedure for purification of enzyme 
was followed as described in Ye and Hong (4) . 
This Bst DNA polymerase possessed a proof-reading 
3' -5' exonuclease activity. 

The Bst polymerase was tested for proof- 
reading and non-specific 3 ' -5 ' exonuclease 
activities as described above. The results showed 
that the polymerase excised the mismatched 
incorporated nucleotides from the 3' terminus of a 
double -stranded DNA at a high rate, reaching the 
plateau of hydrolysis in about 3 minutes, about 8 
times more efficiently in the first 3 minutes of 
reaction than those correctly matched with the 
nucleotides of the template. This enzyme is 
referred to herein as HiFi Bst DNA polymerase, and 
is distinguishable from Bst DNA polymerases 
isolated from other strains of Bacillus 
s t ea.ro th ermophi 1 u s . 

This prodedure of using HiFi Bst as the 
DNA polymerase in the classic radiolabeling Sanger 
reaction for DNA sequencing and its autoradiograph 
are illustrated in Example 8. The data obtained 
by adapting this procedure to use HiFi Bst as the 
DNA polymerase in dye- labeled primer automated 
fluorescent DNA sequencing are illustrated in 
Example 9. These results indicate that HiFi Bst 
DNA polymerase can be used for the classic Sanger 
manual sequencing and the fluorescent dye- labeled 
primer sequencing with high processivity and high 
fidelity. 



owever, when the protocol Wfs modified 
for fluorescent dye- labeled ddNTP terminator 
automated DNA sequencing, numerous U C" peaks and 
U A" peaks were missing or appeared ambiguous in 
the sequence plot, especially when the "C" peak 
was immediately after a n G" peak or after an "A" 
peak, and when the "A" peak was immediately after 
a W G" peak. This selective discrimination against 
incorporation of dye-labeled ddCTP and dye-labeled 
ddATP is sequence-related since many U C" and U A" 
peaks following an immediate "C" peak or an 
immeciate W T" peak remained strong and correctly 
resolved in the same color plot of sequence 
analysis. (See Example 9) This phenomenon of 
selective discrimination against incorporation of 
fluorescent dye -labeled ddCTP and dye -labeled 
ddATP is observed with all DNA polymerases 
isolated from different strains of Bacillus 
stearothexmophilus and Bacillus caldotenax, and 
appears to be characteristic of DNA polymerases of 
the mesophilic bacilli. 



Example 2: Mutation of the gene for naturally- 
occurring Bst DNA polymerase having proofreading 
3' -5' exonuclease activity 

The DNA fragment LF containing the gene 
initially isolated from the wild Bst 320 has the 
following sequence (see SEQ ID N0:1) : 

DNA sequence (isolated/purified) : 

GCCGAAGGGG AGAAACCGCT TGAGGAGATG GAGTTTGCCA 

TCGTTGACGT CATTACCGAA GAGATGCTTG CCGACAAGGC 

AGCGCTTGTC GTTGAGGTGA TGGAAGAAAA CTACCACGAT 

GCCCCGATTG TCGGAATCGC ACTAGTGAAC GAGCATGGGC 

GATTTTTTAT GCGCCCGGAG ACCGCGCTGG CTGATTCGCA 

ATTTTTAGCA TGGCTTGCCG ATGAAACGAA GAAAAAAAGC 



ATGTTTGA^^ CCAAGCGGGC AGTCGTTGCC ^^AGTGGA 
AAGGAATTGA GCTTCGCGGC GTCGCCTTTG ATTTATTGCT 
CGCTGCCTAT TTGCTCAATC CGGCTCAAGA TGCCGGCGAT 
ATCGCTGCGG TGGCGAAAAT GAAACAATAT GAAGCGGTGC 
GGTCGGATGA AGCGGTCTAT GGCAAAGGCG TCAAGCGGTC 
GCTGCCGGAC GAACAGACGC TTGCTGAGCA TCTCGTTCGC 
AAAGCGGCAG CCATTTGGGC GCTTGAGCAG CCGTTTATGG 
ACGATTTGCG GAACAACGAA CAAGATCAAT TATTAACGAA 
GCTTGAGCAC GCGCTGGCGG CGATTTTGGC TGAAATGGAA 
TTCACTGGGG TGAACGTGGA TACAAAGCGG CTTGAACAGA 
TGGGTTCGGA GCTCGCCGAA CAACTGCGTG CCATCGAGCA 
GCGCATTTAC GAGCTAGCCG GCCAAGAGTT CAACATTAAC 
TCACCAAAAC AGCTCGGAGT CATTTTATTT GAAAAGCTGC 
AGCTACCGGT GCTGAAGAAG ACGAAAACAG GCTATTCGAC 
TTCGGCTGAT GTGCTTGAGA AGCTTGCGCC GCATCATGAA 
ATCGTCGAAA ACATTTTGCA TTACCGCCAG CTTGGCAAAC 
TGCAATCAAC GTATATTGAA GGATTGTTGA AAGTTGTGCG 
CCCTGATACC GGCAAAGTGC ATACGATGTT CAACCAAGCG 
CTGACGCAAA CTGGGCGGCT CAGCTCGGCC GAGCCGAACT 
TGCAAAACAT TCCGATTCGG CTCGAAGAGG GGCGGAAAAT 
CCGCCAAGCG TTCGTCCCGT CAGAGCCGGA CTGGCTCATT 
TTCGCCGCCG ATTACTCACA AATTGAATTG CGCGTCCTCG 
CCCATATCGC CGATGACGAC AATCTAATTG AAGCGTTCCA 
ACGCGATTTG GATATTCACA CAAAAACGGC GATGGACATT 
TTCCAGTTGA GCGAAGAGGA AGTCACGGCC AACATGCGCC 
GCCAGGCAAA GGCCGTTAAC TTCGGTATCG TTTACGGAAT 
TAGCGATTAC GGATTGGCGC AAAACTTGAA CATTACGCGC 
AAAGAAGCTG CCGAATTTAT CGAACGTTAC TTCGCCAGCT 
TTCCGGGCGT AAAGCAGTAT ATGGAAAACA TAGTGCAAGA 
AGCGAAACAG AAAGGATATG TGACAACGCT GTTGCATCGG 
CGCCGCTATT TGCCTGATAT TACAAGCCGC AATTTCAACG 
TCCGCAGTTT TGCAGAGCGG ACGGCCATGA ACACGCCAAT 
TCAAGGAAGC GCCGCTGACA TTATTAAAAA AGCGATGATT 
GATTTAGCGG CACGGCTGAA AGAAGAGCAG CTTCAGGCTC 
GTCTTTTGCT GCAAGTGCAT GACGAGCTCA TTTTGGAAGC 
GCCAAAAGAG GAAATTGAGC GATTATGTGA GCTTGTTCCG 
GAAGTGATGG AGCAGGCCGT TACGCTCCGC GTGCCGCTGA 
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AAGTCGACSF CCATTACGGC CCAACATGGT ATCATGCCAA 
ATAA (1764 nucleotides total) 



Site directed mutagenesis was performed 
as described by Kunkel et al. (14) The DNA fragment 
(designated "LF") containing the gene for Bst DNA 
polymerase having proofreading exonuclease 
activity was cloned from the expression vector 
pYZ23/LF into plasmid pUC119. The constructed 
plasmid pUC119/LF was then transformed into E. 
coli CJ236, a mutant of E. coli that lacks the 
enzymes dUTPase and uracil N-glycosylase . 
Therefore, when grown in a medium supplemented 
with uridine, this mutant of E. coli as well as 
the plasmids in the cells will incorporate 
deoxyuridine into the DNA in place of thymidine 
and the uracils will not be removed readily. 

As the constructed plasmid grew in the 
cells of E. coli CJ23 6 and in the presence of 
uracil and M13K07 helper phage, the normal 
thymidine bases of the DNA in the newly produced 
single-stranded pUC119/LF were replaced by 
uracils. These uracil-containing DNAs were used 
as the template in vitro for the production of a 
complementary oligonucleotide that contained the 
desired DNA sequence alteration, but with only 
TMPs and not dUMP residues. 

In practice, the expression vector 
pYZ23/LF was digested with restriction enzymes Eco 
RI and Bam HI , and the DNA fragment LF was 
separated and cloned into plasmid pUC119 which had 
been previously digested with the same restriction 
enzymes. The constructed plasmid pUC119/LF was 
then transformed into E^ coli CJ236. For gaining 
the uracil-containing single-stranded pUC119/LF, a 
colony of E_^ coli CJ236 containing pUC119/LF was 
selected and inoculated into 2 ml of 2 x YT medium 



which wa^gupplemented with 0.25 ug/m^^>f uridine 
and 2 x 10 8 to 4 x 10 8 pfu/ml of M13K07 as helper 
phage. After incubation at 3 7°C with strong 
agitation for 1 hour, a kanamycin solution (25 
mg/ml in H 2 0) was added to the culture to a final 
concentration of 70 ug/ml . The incubation was 
allowed to continue for another 14-18 hours at 
37°C with strong agitation. Then 1.5 ml of the 
infected culture was transferred to a 
microcentrifuge tube, and centrifuged at 12,000 x 
g for 5 minutes at 4°C. The uracil-containing 
single-stranded pUC119/LF was precipitated and 
purified from the supernatant according to 
standard PEG/NaCl and ethanol procedures. 

After performing a series of 
experiments, the inventors found that the combined 
effects of changing the amino acids leucine- 
glutamate-glutamate (LEE) at the location 342-344, 
to respectively threonine-proline- leucine (TPL) , 
and the amino acid phenylalanine (F) at location 
422, to tyrosine (Y) in the peptide structure of 
HiFi Bst DNA polymerase markedly reduced its 
selective discrimination against incorporation of 
fluorescent dye-labeled ddCTP and dye-labeled 
ddATP to such a level that direct automated 
fluorescent DNA sequence (although not cycle- 
sequencing) can be performed with the dye- 
terminator technology when the mutated enzyme of 
the current invention is used. It is of interest 
to note that bhis modified HiFi Bst, now referred 
to as HiFi Bst-II DNA polymerase, exhibits the 
function of preferentially incorporating more 
fluorescent dye -labeled ddCTP and dye -labeled 
ddATP onto the 3' end dGMP and the dAMP bases of 
the extending DNA strands during enzymatic 
reaction, than the unmodified naturally occurring 
HiFi Bst polymerase. 
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n ^"rhe end result is the recovery of the 

W C" and U A" peaks which otherwise would have been 
missing or ambiguous on the sequence analysis 
color plot. At the same time, the modified enzyme 
did not indiscriminately generate an excess amount 
of dye -labeled U G" terminated or dye -labeled "T" 
terminated DNA fragments. Even the U C" and "A" 
peaks were not uniformly raised in a blanket 
manner, but only raised in the formerly depressed 
locations after a "G" and/or an U A" . (See Example 
9) . Thus, this genetic modification of the HiFi 
Bst to HiFi Bst-II results in a DNA polymerase 
that reduces the selective discrimination against 
incorporation of the fluorescent dye -labeled ddCTP 
and dye-labeled ddATP, rather than merely 
increases the ability of the parent enzyme to 
incorporate these dye-labeled dideoxynucleotides . 

HiFi Bst-II, and the other novel similar 
DNA polymerases of this invention, can be used for 
the classic radiolabeling Sanger method. (See 
Example 8.) HiFi Bst-II appears to generate a 
better sequencing pattern than HiFi Bst and 
requires less ddNTPs to terminate the extending 
reaction (Figure 3) . For instance, in the 
optimized reaction mixture for the unmodified HiFi 
Bst DNA polymerase, the ddNTP/dNTP ratios in the 
A, C, G and T mix were 40, 6.25, 18.25 and 18.72, 
respectively. In the optimized reaction mixture 
for the modified HiFi Bst-II DNA polymerase, the 
corresponding ddNTP/dNTP ratios in the A, C, G and 
T mix were 40, 2.5, 6.25 and 6.25, respectively. 
Therefore, there was an up- to about three- fold 
reduction in the amount of ddNTPs used after 
genetic modification of the naturally-occurring 
DNA polymerase. 

For the radiolabeling classic Sanger 
method of DNA sequencing, the optimized reaction 



mixtures^Kr either HiFi Bst or HiFi Est- II must 
contain much more ddNTPs than dNTPs to generate a 
ladder of DNA fragments for sequencing analysis 
because the DNA polymerases of the mesophilic 
bacilli tend to incorporate dNTPs more efficiently 
than ddNTPs. The above -described genetic 
modification appears to increase the ability of 
the naturally-occurring enzymes to incorporate 
ddNTP in the presence of a corresponding competing 
dNTP to about three -fold at the concentration 
ratios commonly used for DNA sequencing. However, 
if much higher concentrations of the nucleotides 
were used for the experiment, and the ddNTP /dNTP 
ratio was reduced to a level that is suboptimal 
for DNA sequencing (for instance at a ratio of 
1/3) , the increased ability for incorporating 
ddNTPs after modification of the enzyme could be 
dramatized. (See Example 6, Figure 2) . 

Similar to the results obtained with 
radiolabeling Sanger method, both HiFi Bst and 
HiFi Bst -II can be adapted for fluorescent dye- 
labeled primer automated DNA sequencing and 
produce comparable results without selective 
suppression of any specific fluorescent peaks in 
the sequencing plot (see Example 9) although the 
peaks generated by HiFi Bst -II appear to be more 
even than those by HiFi Bst . 

To change amino acids leucine, glutamic 
acid and glutamic acid (LEE) at positions 342-344 , 
respectively in the Bst polymerase into threonine, 
proline and leucine (TPL) , respectively, Primer 1 
was designed as following (see SEQ ID NO 10) : 
5 ' - CATTCCGATTCGG ACCCCACTG GGGCGGAAAATCCG - 3 

To change amino acid phenylalanine (F) 
at position 422 in the Bst DNA polymerase into 
tyrosine (Y) , Primer 2 was designed as following 
(see SEQ ID NO: 9) : 



5 ' -GCClPfAACTACGGTATCGTTTACGG-3 ' 

After phosphorylation of the 5' ends of 
the oligonucleotides by T4 polynucleotide kinase, 
the two primers designed above were annealed to 
the single- stranded uracil -containing pUC119/LF 
purified from above. In the presence of the usual 
dNTPs (dATP, dCTP, dGTP and dTTP) , T4 DNA 
polymerase was used to synthesize in vitro the 
strands of DNA complementary to the uracil - 
containing pUC119/LF template, and T4 ligase was 
used to ligate the synthesized strands to form a 
complete double -stranded plasmid which was 
composed of one single-stranded, not mutagenic, 
uracil -containing pUC119/LF and one complementary 
single -stranded, mutagenic, thymidine -containing 
DNA fragment that had been altered by primer 1 and 
primer 2 described above. These newly formed 
double -stranded plasmids were then transformed 
into E. coli JM109. The template strand was 
rendered biologically inactive. The transformed 
strain of E. coli JM109 whose plasmids contained 
the mutated DNA, now referred to as pUC119/LF-M, 
was screened out with DNA sequencing of its 
plasmids . 

Example 3: Cloning and expression of the modified 
Bst DNA polymerase having both ability to reduce 
selective ddNTP discrimination and proofreading 
3' -5' exonuclease activity 

The plasmid pUC119/LF-M was prepared 
from the strain of Escherichia coli JM109 
containing the mutated DNA. The mutated DNA 
fragment (LF-M) containing the mutated gene for 
the Bst polymerase was recombined back into the 
expression vector pYZ23. The constructed plasmid 
pYZ23/LF-M was then transformed into Escherichia 



coli JFl^re. The mutation was further confirmed by 
double -stranded dideoxy DNA sequencing of isolated 
plasmid. 

The strain of Escherichia coli JF1125 
containing pYZ23/LF-M was inoculated into LB 
culture containing 100/zg/ml ampicillin, and was 
incubated overnight at 30°C. The overnight culture 
was inoculated into a large volume of fresh 
culture, and was incubated at 3 0°C until the OD 600 
of the culture reached 0.7. The culture was then 
heated at 41°C for 3 hours for induction. The SDS- 
PAGE analysis of the cell extract showed that the 
cloned mutated gene for the modified Bst DNA 
polymerase was overexpressed. 

Example 4 : Isolation and purification of the 
modified Bst DNA polymerase having both ability to 
reduce selective ddNTP discrimination and 
proofreading 3' -5' exonuclease activity 

The expressed cells of Escherichia coli 
JF1125 containing pYZ23/LF-M grown in condition as 
described above were thawed and washed with buffer 
[lOraM Tris-HCl (pH7.5 at room temperature), lOmM j8- 
Mercaptoethanol , 2 mM EDTA, 0.9% NaCl] . The 
pellets were then suspended in buffer [50mM Tris- 
HCl (pH7.5 at room temperature), lOmM jS- 
Mercaptoethanol , 2mM EDTA, 100/xg/ml Lysozyme, 
23/ig/ml PMSF] (4ml/g pellet) . After 20 min at room 
temperature, the mixture was cooled on salt -ice 
and sonicated briefly. to complete lysis. The cell 
extract obtained by centrifugation at 18,000rpm at 
4°C for 2 0 minutes, was then treated step by step 
as follows: 

(A) The cell extract was heated at 60°C for 30 
minutes, and cooled to 4°C, then centrifuged at 
15,000rpm at 4°C for 20 minutes; 



(B) 5% ^^min P was added into supel^ftant to 

0.6%, and mixed quickly for 3 0 minutes, then 
cent r i f uged ; 

(C) The pellet was resuspended in Buffer A[50mM 
Tris-HCl (pH7 . 5 at room temperature), ImM EDTA, ImM 
/?-Mercaptoethanol] containing 800mM NaCl and 5% 
Glycerol at 4°C, and then centrifuged; 

(D) Ammonium sulfate was added into the 

supernatant to 60% saturation at 4°C, and 
mixed for 30 minutes, then centrifuged; 

(E) The ammonium sulfate pellet was resuspended in 
30ml of 60% saturated ammonium sulfate at 4°C, and 
then recentrifuged; 

(F) The pellet was suspended in Buffer A 
containing lOOmM KC1 and dialysed against the same 
buffer for hours at 4°C, then centrifuged. The 
insoluble protein was discard; 

(G) The supernatant was added to pass through a 
DE-52 column. The column was washed, and the peak 
DNA polymerase was eluted using a 100-600 mM KCl 
linear gradient in Buffer A, concentrated in 
Buffer A containing 50%(w/v) PEG-6000, dialyzed in 
Buffer A containing 100 mM KCl ; 

(H) The solution was then applied to Heparin^ 
Sepharose CL-4B column. The peak DNA polymerase 
was eluted with a linear gradient of 100-800 mM 
KCl in Buffer A, concentrated and finally dialyzed 
in buffer A containg 50% glycerol. 

The resulting modified Bst DNA polymerase 
has been proved to be homogenous by polyacryl amide 
gel electrophoresis. And the enzyme obtained was 
stored in -20°C. 



Example^P^ Determination of the thermostability 
of unmodified Bst DNA polymerase and modified Bst 
DNA polymerase 



The DNA polymerases of Examples 1 and 5 
were incubated at 65°C for 0, 5, 10, 20, 30, 40, 
50 minutes respectively, and placed into ice-water 
immediately. The polymerase activity of these DNA 
polymerases was determined at 60 °C. 

The polymerase activity of DNA 
polymerase was determined as follows: 
5 x Reaction Solution: 
1M Tris-HCl (pH7.6) 16.75 ml 

1M MgCl 2 1.675 ml 

1M j8-Mercaptoethanol 0.25 ml 

ddH 2 0 adjusted to 50 ml 

Reaction Storage: 

5 x Reaction Solution 60/zl 

dNTPs (ImM each) 10/il 
l-5/xg//xl DNase I activated 

calf thymus DNA 10/il 

ddH 2 0 10 til 

Qf- 32 P-dATP appropriate amt . 



Reaction Mixture: 

Reaction Storage 30/xl 

Sample 5/zl 

ddH 2 0 y 65/zl 

The reaction mixtures were prepared as 
per the recipe above, and incubated at 60 °C for 3 0 
minutes. Then the reaction mixtures were pipetted 
onto DE-81 filters respectively. After all of the 
fluid has evaporated, the amount of radioactivity 
on each filter was measured with scintillation 



(X,) . ilters were washed three with 

0.3M Na 2 HP0 4 at room temperature, 10 minutes each 
times, dried at room temperature and then the 
amount of radioactivity on each filter was 
measured again (X 2 ) . 

The polymerase activity of sample 

(u/ml)= 

X 2 — X2 0 x 2 6 6 



X 1 X 



10 



(X 10 and X 20 are the amount of radioactivity 
measured with water as control sample) 

Unit definition of polymerase activity: 
One unit is the amount of DNA polymerase required 
to incorporate 10 nanomoles of dNTPs into DNA in 
30 minutes at 60°C . 

The thermostability of DNA polymerase is 
expressed with the half life of polymerase 
activity at 65 °C. Figure 1 shows the comparison of 
thermostabilty of HiFi Bst and HiFi Bst-II. The 
half life of HiFi Bst at 65°C was 8.5 minutes, and 
that of HiFi Bst-II was 16 minutes. HiFi Bst-II 
was more thermostable than HiFi Bst. 



Example 6: Demonstration of increased ddNTP 
incorporation by modified Bst DNA polymerase in 
suboptimal sequencing conditions 

The following procedure was followed: 

1. The -20M13 forward primer was 
radiolabelled using y- 32 P-ATP and T4 Polynucleotide 
kinase ; 

2. The following components were combined in 
a microcentrifuge tube: 

5xReaction Buffer 2.0/zl 
radiolabeled primer 1 . 0/zl (2.5ng) 



Templat^^ 7.0/xl (1/xg M13mpl^^DNA) 

The final volume was 10/zl. The contents were mixed 
and spun for 2-3 seconds; 



3. The tube were placed in a 75°C water bath 
for 5 minutes. Then the tube was allowed to cool 
slowly to ambient temperature over a course of 10 
minutes; 



4. 1.0/xl of modified Bst DNA polymerase (of 
Example 5) (lu/fil) was added. The mixture was 
mixed gently and spun for 2-3 seconds; 

5. 4 tubes were labelled U A" , "C", "G", "T", 
respectively and 2/il of each premixed nucleotide 
solution and 2.5/xl of main mixture (from step 3) 
was added to the respective reaction 

tube ; 



6. The tubes were incubated at 65°C for 15 
minutes ; 



7. The reactions were stopped by adding 4.0jxl 
of Stop Solution (95% deionized formamide, lOmM 
EDTA, 0.05% xylene cyanol FF, 0.05% bromophenol 
blue) to each tube; 



8. The samples were denatured at 90°C for 2 
minutes, and immediately placed on ice; 

9. 4-5/xl of samples were loaded onto each 
lane of 6% (8M urea) sequencing gel, and 
electrophoresis was carried out. 

Note: The Components of the Premixed Nucleotide 
Solutions : 

A mix: dNTPs 120/zM, ddATP 4 0/xM 
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C liUJF: dNTPs 120/zM, ddCTP 4 0/zM 
G mix: dNTPs 120/zM, ddGTP 40/zM 
T mix: dNTPs 12 0/iM, ddTTP 4 0/iM 



Figure 2 shows the comparison of ddNTP 
incorporation of HiFi Bst-II DNA polymerase and 
HiFi Bst DNA polymerase. In this radiolabeling 
DNA sequencing experiment, high concentrations of 
nucleotides were used in the reaction mixture and 
the ddNTP/dNTP ratio was reduced to a level (1/3) 
that is lower than the optimal range for DNA 
sequencing. HiFi Bst-II is shown to have more 
effective ddNTP incorporation. The DNA synthesis 
was often terminated by ddNTP incorporation in the 
HiFi Bst-II mixture, and the result showed uniform 
bands with synthesized small or large DNA 
fragments. As a contrast, HiFi Bst had a lower 
ddNTP incorporation. The DNA synthesis by HiFi Bst 
was less terminated, and most of the synthesized 
products were the larger DNA fragments. 

Example 7: Preparation of Denatured Double- 
stranded DNA Template: 

The following procedure was carried out. 

1. Double -stranded DNA (about 3-5^g) was 
adjusted to a final volume of 10//1 with TE (lOmM 
Tris-HCl, ImM EDTA, pH8.0); 

2. 10/xl of 0.4N NaOH, 0 . 4mM EDTA, was added; 

3. The mixture was incubated at 65°C for 15 
minutes; 

4. 2/il of 2M sodium acetate, pH4.5, and 55/zl 
cold ethanol was added, and the mixture was placed 



in ice-Wcffer bath for 5 minutes; 



5. The mixture was spun in a microcentrifuge 
at 4°C, 12500rpm for 5 minutes; 

6. The supernatant was drawn off and the 
pellet was washed with 200/*l of 70% ethanol; 

7. The pellet was dried under vacuum for 2-3 
minutes, and the DNA was dissolved in appropriate 
solution. 

Example 8: DNA sequencing using unmodified Bst 
DNA polymerase/modif ied Bst DNA polymerase with 
radiolabeled dATP for single- or denatured double- 
stranded DNA template 

The following procedure was carried out. 

1. The following components were combined in 
a labeled microcentrifuge tube: 

BxReaction Buffer 2.0^1 

Primer 1.0/xl (2.5-5.0ng) 

Template 7.0/zl (250-500ng ss DNA 

or 1-3/xg denatured ds DNA) 
The final volume was 10/xl. The contents were mixed 
and spun for 2-3 seconds ; 

2. The tube were placed in a 75°C water bath 
for 5 minutes. Then the tubes were allowed to cool 
slowly to ambi N ent temperature over a course of 10 
minutes; 

(Note: Step 2 is optional for single-stranded 
template, and may be omitted at appropriate.) 

3. 1.0/xl of HiFi Bst/HiFi Bst-II (lu///l) and 
1.0/il of [of-32P]dATP was added, and the mixture 
was mixed gently and spun for 2-3 seconds; 



4 . ^^Ftubes "A", "C", "G" , "T" were labelled, 
and 2/xl of each premixed nucleotide solution and 
2.5/il of main mixture (from step 3) was added to 
the respective reaction tube; 

5. The tubes were incubated at 65 °C for 2 
minutes; 

6. 2.0/xl of 0.5mM dNTPs was added to each 
tube, and the tubes were mixed gently, spun for 2- 
3 seconds, and incubated at 65 °C for 2 minutes; 

7. The reactions were stopped by adding 4.0^1 
of Stop Solution (95% deionized formamide, lOmM 
EDTA, 0.05% xylene cyanol FF, 0.05% bromophenol 
blue) to each tube; 

8. The samples were denatured at 90°C for 2 
minutes, and immediately placed on ice; 

9. 2-3/xl of the samples were loaded onto each 
lane of 6% (8M urea) sequencing gel, and 
electrophoresis was carried out. 

Note: The Components of the Premixed Nucleotide 
Solutions for HiFi Bst : 

A mix: dATP0.62/zM, dCTP 62/xM, dGTP 62 /iM, dTTP 
62 iM, ddATP 25/xM; 

C mix: dATP 0.8/iM, dCTP 8/zM, dGTP 80/xM, dTTP 
80/xM, ddCTP SQ/xM; 

G mix: dATP 0.8/zM, dCTP 80/iM, dGTP 4/zM, dTTP 
80^M, ddGTP 75/xM; 

T mix: dATP 0.8/zM, dCTP 80/xM, dGTP 80/xM, dTTP 
8/iM, ddTTP 150/zM. 

The Components of the Premixed Nucleotide 
Solutions for HiFi Bst-II: 



A fflx: dATP 0.8/iM, dCTP 80//M, dGTP 80/xM, dTTP 
80/zM, ddATP 25/zM; 

C mix: dATP 0.8/zM, dCTP 8/xM, dGTP 80/zM, dTTP 
80/zM, ddCTP 2 0/zM; 

G mix: dATP 0.8/zM, dCTP 80/xM, dGTP 8/zM, dTTP 
80/iM, ddGTP 50/xM; 

T mix: dATP 0.8/zM, dCTP 80 (iM, dGTP 80/iM, dTTP 
8/iM, ddTTP 50/zM. 

Figure 3 shows the comparison of 
radiolabeled DNA sequencing with HiFi Bst and HiFi 
Bst-II in their respective optimized reaction 
mixtures. The bands on a DNA sequencing gel with 
HiFi Bst-II were more uniform than those with HiFi 
Bst. The sequence pattern using HiFi Bst-II was 
better than that using HiFi Bst. This made the gel 
with HiFi Bst-II even easier to read. Moreover, 
the ratio of dideoxy-nucleotide concentration to 
deoxy-nucleotide concentration in Premixed 
Nucleotide Solutions for HiFi Bst-II was lower 
than that for HiFi Bst. For instance, the 
ddATP/ dATP , ddCTP/dCTP, ddGTP/ dGTP and ddTTP/dTTP 
ratios were 40, 6.25, 18.75 and 18.75, 
respectively, for HiFi Bst. The corresponding 
ratios for HiFi Bst-II were 40, 2.5, 6.25 and 
6.25, respectively. Therefore, the concentrations 
of ddNTPs used in the termination reaction are 
reduced to about three -folds after HiFi Bst has 
been genetically modified. 

Example 9: DNA sequencing using unmodified Bst 
DNA polymerase/modif ied Bst DNA polymerase with 
dye -primers for single- or denatured double - 
stranded DNA template 

The following procedure was carried out. 



1. The following was combined: 4.0/il DNA 
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template^^00-600ng for ssDNA or 1-2/ic^denatured 
ds DNA) with 5 . 0/il 5xReaction Buffer. The mixture 
was mixed and spun for 2-3 seconds in a 
microcentrifuge; 



2. 4 tubes were labeled U A" , "C", "G", "T" 
and the pre -mixed dATP, dCTP, dGTP, dTTP reagents 
were added to each reaction tube: 

Reaction tube 
A C G T 

A-REG primer ( 0 . 2uM) 1 . Oul 
C-FAM primer (0.2uM) 1 . Oul 

G-TMR primer (0.4uM) 1 . Oul 

T-ROX primer (0.4uM) 1 . Oul 

A terminator mix 2 . Oul 

C terminator mix 2 . Oul 

G terminator mix 2 . Oul 

T terminator mix 2 . Oul 

5 x Reaction buffer 

with DNA template 2 . Oul 2 . Oul 2 . Oul 2 . Oul 

Total volume 5 . Oul 5 . Oul 5 . Oul 5 . Oul 

3. The tubes were placed in 75°C bath for 5 
minutes, and allowed to cool slowly to ambient 
temperature over the course of 10 minutes; 
(Note: Step 3 is optional for single-stranded 
template, and may be omitted as appropriate.) 

4. 1/il of v HiFi Bst/HiFi Bst-II (0.5u//zl) was 
added to each tube, and the tubes were spun for 2- 
3 seconds ; 

5. The tubes were incubated at 65°C for 5 
minutes ; 



6. The contents of the "A", W C" , "G" and "T" 




tubes were pooled, and 1.5/xl of 7 . 5M ammonium 
acetate and 55/zl of ethanol was added. The 
mixtured was mixed in a vortex and then placed on 
ice for 2 0 minutes; 

7. The mixture was centrifuged at 12 / 500rpm 
for 20 minutes at 4°C; 

8. The supernatant was drawn off, and the 
pellet was washed with 2 00/il of 70% ethanol; 

9. The pellet was vacuum dried for 2-3 
minutes, and resuspended in 4/zl of loading buffer 
(5:1 deionized formamide : 25mM EDTA with 50mg/ml 
Blue Dextran) ; 

10. The sample was heated at 75 °C for 2-3 
minutes, and immediately placed on ice; 

11. 2-3 /il of sample was loaded onto a lane 
of the 4% (6M urea) sequencing gel, and ABI PRISM™ 
377 DNA Sequencer (from Perkin Elmer) was used to 
collect data. 

Note: Dye primer: DYEnamic Energy Transfer Dye 
Primers (from Amersham) : 

-21 M13 forward: 5' -FAM-S T SSSSSTGT*AAAACGACGGCCAGT- 
3' (SEQ ID NO: 11) 
TS=1 ' 2 ' -dideoxyribose 

T*=T attached with Dye 2 (A-REG, C-FAM, G-TMR, T- 
ROX) 

Figure 4 and Figure 5 show the results 
of dye -primer DNA sequencing with HiFi Bst and 
HiFi Bst-II. Both DNA polymerases generated 
similar sequencing results although the peaks on 
the color plot by HiFi Bst II appear to be more 
even in height. 
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Example^): DNA sequencing using unmodified Bst 
DNA polymerase/modif ied Bst DNA polymerase with 
dye-terminators for single- or denatured double- 
stranded DNA template 

The following procedure was carried out 



1- The following components were combined in 
a labeled microcentrifuge tube: 

BxReaction Buffer 4 . 0/zl 
Template 8.0/xl (2-3/zg ss DNA 

or 4- 6 fig denatured 

ds DNA) 

Primer 2.0/zl (5-10ng) 

The final volume was 14/il. The contents were 
mixed and spun for 2-3 seconds; 

2. The tube was placed in a 75 °C water bath 
for 5 minutes; 



3. The tube was allowed to cool slowly to 
ambient temperature over a course of 10 minutes ; 
(Note: Steps 2 and 3 are optional for single- 
stranded template, and may be omitted as 
appropriate . ) 

4. 1.0/zl of HiFi Bst/HiFi Bst-II (l-2u//il) , 
5/il of nucleotides premix (containing Perkin 
Elmer-ABI fluorescent dye-labeled nucleotide 
terminators) , were added and the tube was spun for 
2-3 seconds; 

5. The mixture was incubated at 65°C for 10 
minutes ; 



6. 80/il of H 2 0 was added to the reaction mix, 
and the dye terminators were extracted with 100/xl 
of phenol :H 2 0: chloroform (68:18:14) reagent twice. 
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The sampl^was vortexed and centrif uged, and the 
aqueous upper layer was transferred to a clean 
tube ; 



7. To the tube was added 15/il of 2M sodium 
acetate, pH 4.5, and 3 00//1 of ethanol, and the 
tube was vortexed and placed in ice -water bath for 
20 minutes ; 

8. The tube was centrif uged with 12,500rpm 
for 20 minutes at 4°C; 

9. The supernatant was drawn off, and the 
pellet was washed with 200jzl of 70% ethanol; 

10. The pellet was vacuum dried for 2-3 
minutes, and resuspended in 4/zl of loading buffer 
(5:1 deionized formamide : 25mM EDTA with 50mg/ml 
Blue Dextran) ; 

11. The sampled was heated at 90 °C for 2-3 
minutes, and immediately placed on ice; 

12. 2-3/il of sample was loaded onto a lane of 
the 4% (6M urea) sequencing gel, and ABI PRISM™ 
377 DNA Sequencer (from Perkin Elmer) was employed 
to collect data, using appropriate amounts of 
nucleotide pre-mixed reagents. 

Figure 6 and Figure 7 show the results of 
dye -terminator DNA sequencing with HiFi Bst and 
HiFi Bst-II. There was data lost in dye -terminator 
DNA sequencing with HiFi Bst, especially the "C" 
after "G" or n A" and "A" after "G" . In Figure 8, 
corrections of the missing or ambiguous bases, 
according to the known pGEM sequence, have been 
indicated below the letters "N" or below the 



incorrect base letters. This problem caused 
ambiguity in DNA sequencing. But it was resolved 
in dye -terminator DNA sequencing with the modified 
Bst DNA polymerase of this invention . 
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